Approximate Representations 
and Approximate Homomorphisms 



Cristopher Moore 
Computer Science Department 
University of New Mexico 
and the Santa Fe Institute 
moore@cs . unm . edu 



Alexander Russell 
Computer Science and Engineering 
University of Connecticut 
acr@cse . uconn . edu 



October 1, 2010 



Abstract 

Approximate algebraic structures play a defining role in arithmetic combinatorics and have 
found remarkable applications to basic questions in number theory and pseudorandomness. 
Here we study approximate representations of finite groups: functions xp : G — > such that 
Pr[ip(xy) = tp(x) ip(y)] is large, or more generally E™ \\xp(xy) — ip(x) xp(y)\\ 2 is small, where 
x, y are uniformly random elements of the group G and denotes the unitary group of degree 
d. We bound these quantities in terms of the ratio d / d m in where d m i n is the dimension of the 
smallest nontrivial representation of G. As an application, we bound the extent to which a 
function f : G — »■ H can be an approximate homomorphism where H is another finite group. 
We show that if H's representations are significantly smaller than G's, no such / can be much 
more homomorphic than a random function. 

We interpret these results as showing that if G is quasirandom, that is, if d m \ n is large, then 
G cannot be embedded in a small number of dimensions, or in a less-quasirandom group, 
without significant distortion of G's multiplicative structure. We also prove that our bounds 
are tight by showing that minors of genuine representations and their polar decompositions 
are essentially optimal approximate representations. 

In additive combinatorics and number theory, an approximate subgroup of a group G is a subset 
H which is roughly closed under multiplication: that is, such that Pr X/J/ \xy G H] is large where x, y 
are uniformly random elements of H. We focus on approximate group representations — functions xp 
from G to U^, the group of d x d unitary matrices, such that xp acts roughly like a homomorphism. 
We then use our results to bound the existence of approximate homomorphisms from G to another 
finite group H. 

Let G be a finite group and let xp : G — > U^,. If ip(xy) = ip(x) ip(y) for all x,y E G, then we 
call xp a representation. We are interested in understanding how close xp can be to a representation 
if G does not in fact have any (^-dimensional representations — in particular, in the case where G 
is quasirandom [1J in the sense that its smallest nontrivial representation has dimension d m m > dy. 

We can measure the extent to which xp fails to act as a representation by the expected £2 distance 
between xp(xy) and xp(x) xp{y), where x and y are chosen uniformly from G. To control the trivial 
case where xp{x) = 1 for all x, we assume that E z xp(x) is bounded in its operator norm. We also 




assume that the expected Frobenius norm squared of xp{x) is cL,, which holds, for example, if each 
xp{x) is unitary. 

Our main theorem asserts that the expected I2 distance is bounded below by a function of 
the ratio d^/d m in- Roughly speaking, if we think of xp as a low-dimensional embedding of G, we 
cannot avoid a certain amount of "distortion" of G's multiplicative structure. We let || A\\ denote 

the operator norm, and let || A || F = tr A f A denote the Frobenius norm. 

Theorem 1. Let Gbe a group and let d m in denote the dimension of G's smallest nontrivial irrep. For any 
function xp : G — > , 

x E G \\ip(xy)-xp(x)xp(y)\\l>2d ip ^1 - ||IE t/,(x)|| 3 op - | . (1) 

If A and B are random unitary matrices of dimension dy distributed according to Haar measure, 
then E^g \\A — B|| F = 2d. Thus Theorem [TJ shows that when d^/d m in and ||E X t/?(x)|| o p are small, 
xp is little better than a random function from G to U<f as far as acting like a representation is 
concerned. 

We comment that Theorem[l]holds in the more general setting where xp is a function from G to 
the group G of invertible ^-dimensional matrices, as long as xp is "unitary in expectation" in 
the sense that 

Exp(x) f xp(x) = 1. (2) 

In the regime where xp is very close to a representation, our work is related to Babai, Friedl, 
and Lukacs EUSJ. They showed that if \\xp(xy) — xp(x) xp(y)\\ ¥ is sufficiently small, then there is 
a genuine representation p with d p = dy such that p is close to xp. Their definitions are slightly 
different; for instance, they consider uniform bounds on ||t/?(xy) — xp(x) xp(y) || F rather than its ex- 
pectation over all pairs of elements x, y G G, and also place a bound on||t/?(l) — 1 1| F . Nevertheless, 
the Fourier-analytic proof of Theorem [T] uses similar Fourier analytic techniques as in their work. 

Theorem [l] yields the following corollary, bounding the probability that xp{xy) = xp{x) xp{y) for 
uniformly random x, y: 

Corollary 1. Let G, d m [ n , and xp : G — > Ud f be as in Theorem \l\lfx,y£ G are uniformly random, then 
Pr[xp(xy)=xp(x)xp(y)\ < 1 ^1 + ||E?(x)||^ 

When dxh I d m i n is small, this is tight for a random function xp that sends half the elements of G to 1 
and the other half to —1, where each half is chosen uniformly at random from all subsets of size 
|G|/2. 

As an application of these results, we consider approximate homomorphisms / : G — > H 
where H is another finite group, bounding the probability that f(xy) = f(x) f(y) for uniformly 
random pairs x, y G G. To avoid the trivial homomorphism f(x) = 1, we require that f's image is 
close to uniform. For each y G H define the probability 

Pf(y) = P v r [/( x ) = y\ 
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that a uniformly random x £ G has image y. Then we bound the £2 distance between p * and the 
uniform distribution u(y) = 1/\H\, requiring that 



\pf- u \\ 2 = E 

yen 



Pf(y) ~ 4t 



2 e 



For instance, this holds with e = 1 if p<-(y) is uniform on a subgroup of H of index 2. 

We will use the fact that if / is an approximate homomorphism then, for each irrep a of H, the 
composition a o f is an approximate representation of G. Our first bound focuses on one a at a 
time. 

Theorem 2. Lei G and H be finite groups, and let d mm denote the dimension of G's smallest nontrivial 
irrep. Let f : G — > H such that 10 holds. Then 



Pr[/(xy) = /(*) f(y)) < \ rma f 1 + + 



wfoere cr ranges over all ofH's nontrivial irreps. 

If is perfectly uniform so that e = 0, this expression is minimized by H's smallest nontrivial 
irrep a. In that case, / cannot act very homomorphically if H is much less quasirandom than G is. 

Our second bound considers all of H's irreps, not just the smallest one. Recall that the Plancherel 
measure assigns each irrep a G H the probability P(o~) = d\l \H\. If denotes the expectation of 
Vda/dmm or 1, whichever is smaller, 

«H(*-o-E|f*(^i)- w 

then we have the following. 

Theorem 3. Let G and H be finite groups, and let d m i n denote the dimension of G's smallest nontrivial 
irrep. Let f : G — > H such that 10 holds. Then 

Yr[f{xy)=f{x)f{y)} < \\p f \\\ + R H {d min ) = i±£ + R H (d min ) . 

If Rh and e are small, i.e., if most of H's irreps are much smaller than dmin and f's image is close 
to uniform, Theorem [3] shows that / cannot act like a homomorphism much more often than a 
random function from G to H. 

Finally, we give two results indicating that the bounds of Theorem[T]are essentially tight. First 
we show that they are achieved exactly if xp is proportional to a minor of a genuine irreducible 
representation. While these minors are not unitary, we can scale them so that they are unitary in 
expectation in the sense of l(2]). 

Theorem 4. Let p : G — > U(V) be an irreducible representation of G of dimension d p and let II : V — > V 
be a projection operator onto a dy-dimensional subspace WQV.Ifwe define 

ip(x) = J-fUp{x)U, 

V dtp 
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then E x tp(x) = 0, E x = EE, and 




E \\f(xy)-ip(x)tp(y)\\l = 2d tp 

x,y£G 

Note that this precisely matches our upper bound in Theorem[TJin the case E T tp(x) = 0. 

In our last result, we use the polar decomposition to make these approximate representations 
unitary. This comes at some cost to the expected Frobenius norm, but there is still a regime for 
I dp where we can achieve significantly stronger results than those of a random function. First 
recall that if A is a d-dimensional complex matrix of full rank, its polar decomposition expresses A 
as the product of a unitary matrix A and a positive semidefinite matrix 

A = AP, 

where 

A = A(A f A)- 1/2 and P={A f A) 1/2 . 

That is, A = AP^ 1 where P is the unique positive semidefinite matrix such that P 2 = A* A. It is 
a simple exercise to show that A is unitary. More importantly, A is the unitary matrix which is 
closest to A in £2 distance |7|. 

Then we have the following theorem. Note that unlike Theorem HI we now assume that IT is 
chosen uniformly. Specifically, given a fixed projection operator IT' of rank dy, we set IT = U f Yl'U 
where U G U(V) is uniform according to the Haar measure. 

Theorem 5. Let p : G — > U(V) bean irreducible representation of G of dimension d p and let IT : V — > V 
be a projection operator onto a subspace W C V chosen uniformly from all projection operators of rank dy. 
Let ip{x) be defined as in Theorem® and let ip(x) be the unitary part of its polar decomposition. Then 

^ x f eG U{xy) -$(x)ip(y)\\ 2 v < 2d 4 , (l - 

It follows that there exists a particular projection operator Tl satisfying the bound above. 

The difference between Theorems |4] and |5] is the cost of making tp(x) unitary — it comes from 
bounding the expected £2 distance between ip(x) and ip{x) and using the triangle inequality. While 
it is intuitive that this cost is nonzero, we have not attempted to optimize this bound. Nevertheless, 
even this relatively crude bound shows that there exist unitary approximate representations that 
perform noticeably better than random matrices — that is, for which 

1 ~ 2 

— E \\rp{xy)-xp(x)tp{y)\\ F <a 




for some a < 1, whenever 



i^/dp > 31 */ 58 = 0.876 .. . 



18 

We conjecture that approximate representations exist with a. < 1 whenever dy/dp > 0. 
Proofs are given in the following three sections. 
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1 Bounds on approximate representations 

In this section we prove Theorem[T]and Corollary[T] In the process, we set our conventions for the 
nonabelian Fourier transform, and prove several inequalities that we will apply later on. 

Proof of Theorem [lj For any A, B we have 

||A-B||p = tr(A-B) + (A-B) = || A\\l + ||B||J - 2Retr A f B . 

Since z = xy is uniformly random whenever x and y are, 

E \\ip(xy) - xp(x) xp(y)\\l = E \\ip(z)\\* + E \\ip(x) ip(y)\\l — 2Re E tr </>(xy) + jp(x) </>(y) . (5) 

x t y z x,y x t y 

If tp is unitary in expectation, (0 implies 

E\\ip(z)\\l = tiEip(z) i ip(z)=trl = (L,, (6) 

2 Z T 

and of course this holds identically if xp is unitary. Similarly, 

E U(x)tp(y)\\l = tr E rp(y) f ip(x) f ip(x)ip(y) 

x,y x,y 

= tr(E^(x)>(x)) (Eip{y)ip{y) f ^ = ixt = d f . 

Then ((5]) becomes 



E \\xp(xy) - ip(x) xp{y)\\l = 2d f (l — Re E tr ip f (xy) ip(x) ip{y) 



x,y f \ dip x,y 

Thus we will focus on estimating the expected trace 



(7) 



Vtrxp f (xy)ip(x)xp(y). (8) 

x,y 

Note that if xp is a genuine representation, xp f (xy) ip(x) tp(y) = 1 and this trace is identically dy. 

We rely on nonabelian Fourier analysis, for which we refer the reader to |4J. In order to estab- 
lish our notation and choice of normalizations, let / : G — > C and let p : G — > be an irreducible 
unitary representation of G or "irrep" for short, and let G denote the set of irreps of G. We adopt 
the Fourier transform 

in which case we have the Fourier inversion formula 



f(x)=Zd p tr(f( P )p(x) 

peG 

The Fourier transform preserves inner products in the sense that 



(/i?)=E/W*«W = \G\Ed p ti(f(p)*g(pj) . (9) 

X p 
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In particular, we have Plancherel's identity, 



\G\L d p\\f(P) 



(10) 



Since xp is a matrix-valued function, each entry i/>(x)j- has its own Fourier transform. Therefore, 
we may treat the Fourier transform tp as a tensor with four indices, 

£(([))$ = Ei/>(x)j / o + (>)* or $(p)=E f(x)®p f (x) . 

The Fourier inversion formula can then be expressed as a partial trace. We adopt the Einstein 
summation convention, where any index appearing twice is automatically summed over. For 
instance, (AB)j = A\B* and tr A = A*. Then 



Plancherel's identity becomes 

E||#0 



<K*)) = £^?(p)$p(*)I- 
p 



^ 



P P i,j,k,l 

We compute the trace © by evaluating it in the Fourier basis. First write 

trtp f (xy)xp{x)xp{y) = i/> + (xy))t/>(x) ; fe t/>(y)f 

= £ d p d,d r ^\py^(ai^(x)% P {xy)la{x)lr{y)l 



p,cr,T£G 

= Zjpdrdr ^{p)% $(cr% d e {p(T)Y gP {x)\ p(y) c a <r{x)\ x{yf f 

p,<7,T£G 



l_\5 be 5 cd a = p* 
i p \0 a^p*. 



Schur's lemma implies 

E [p(x)l<r(x) dl 
Thus taking the expectation over x and y turns ((12)) into 

E tr q\xy) ip(x) ip(y) = £d p ^{p)% ^{p*)%6 he 8 cd ^ 6 aj 



x,y 



= j:d P r( P y j i{p(p*) i "J(p*)^. 



We rearrange this slightly, writing ip(p^)% for the partial transpose ^p(p*)%- Then 

{p{p f ) =E[ip(x)®p(x)] , 

and 

x,y 



Etr xp f (xy) xp(x) xp(y) = £d p f^p)f b f (p%f {p')f a . 



(11) 



(12) 



(13) 



(14) 
(15) 
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If we view i/>(jO + ) as a linear operator on C d <8> C f, then 



E 

X 



tp\x) ®p f {x) 



and we can write 



Etvip f (xy)f(x)f(y)=£d p trf(p f ) ({p(p f )) . 



(16) 



We remark that in the case where xp is an irrep the only irrep contributing to the sum ((16)) is ip* 
since, as in ((13)) , we have 



ip(p')fi = E rp(x))p(x) 



J_(6 lk 6 je p = tp\ 
\ p £ ip* . 



Let IT denote the operator (1/dy) 5 lk 5u. Diagrammatically, n is proportional to the "cupcap." 
It is a one-dimensional projection operator, equal to the outer product of the vector 



i 

with itself, where e, denotes the zth basis vector. Since II is Hermitian, ([TBI implies that 

E tr xp f (xy) xp(x) ip(y) = dy trLT 3 = dy trll = dy 

x,y 

which holds since ip f (xy) ip(x) tp(y) = 1. 

Returning to ([TBI , since A + A is positive for any A we have 



trAA f A < ||A|L„ lUllS • 



o P 



This gives 



E trt/> + (xi/) t/>(x) 0(y) < ||</V)|| op 



tM|2 



(17) 



We separate out the term corresponding to the trivial representation p = 1, for which i/>(l) 
E X V(*)- Since II a IIf < d ll A llo P for 

any rf-dimensional matrix A, we have 



Etr ip f (xy)ip(x)ip(y) < ||E<K*)|| op ||E </>(*)) ||* + £ d p \\$(p f )\\ ov ||^(P + ) 



op 



.+ MI 2 

If 



(18) 



< <K*)|lo P + (max||f(p + )|| op ) £d p \\{p(p+)\\ 

= <Mlf <K x )llo P + (^ll^(p + )llop)f IkWIlp 



rf V ( II? ^Wllop + maxH^p 



op 



(19) 
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where we used Plancherel's identity ((TTj) in the third line and the fact that E z ||i/>(x)|L = dy is 
unitary, or unitary in expectation, in the fourth. This is analogous to the Fourier-analytic treatment 
of the Blum-Luby-Rubinfeld linearity test OH]. 

Our next goal is to bound the operator norm of tp(p^). Let V and W denote the spaces on which 
tp and p act, respectively. Then \\ip(p )\\ is the maximum, taken over all vectors u G V <8> W of 

norm 1, of (u, ip(p f ) u\. Using the Schmidt decomposition we can write 

U = OCiVi ® Wi 
i 

where {v{\ and {iVi} are orthogonal bases for V and a d^, -dimensional subspace of W respectively, 
and where | =1. Then separating the tensor product and using Cauchy-Schwarz gives 

u,tp(p f ) u\ = Y^oc*ocj (vi ® Wi,f(p f ) Vj ® Wj S 

hi 

= Y^0C*Kj (vi (g) W ir (E [lp(x) (g> p(x)]j Vj ® Wj 

hi x 



= E [(v it f(x)vj) (w if p(x)wj) 



^ E a **/V (f (^'<K X ) v i) ) (f ( w i'P( x ) w i) 



By Schur 's lemma we have E* (wi,p(x) Wj} 2 = l/d p , giving 



y a p i,j V 

Another application of Cauchy-Schwarz and the fact that \\ip(x) ||p = cL, gives 

1 



u,xl>(p f ) u) < 



Ehfhf lE^^M^T 



EN 2 ) f 



/dp v 



? 11^)111 




dp ' 



(20) 



where we again used the fact that ip(x) is unitarity, or unitary in expectation, in the fourth line. 
Combining |(T9)) with ((20")) and using our hypothesis that min^i d p = d m i n then gives 



V tr ^ (xy) ip(x) xp(y) <dy \\Eip(x)\\ 3 + 



Finally, combining this with (0 completes the proof. 



(21) 
□ 
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Proof of CorollaryHJ For any A, B e U rf with A/Bwe have \\A - B\\l < 4d. Thus 
E U(xy) - ip(x) ip(y) || F < M f P#(xy) ^ xp(x) ip(y)} , 

and so 

Pr[xp(xy) = ip(x) xp(ij)} < 1 - -L E ||^(xy) - </;(x) </>(J/)Hf • 
Combining this with the bound |Q} completes the proof. □ 

2 Approximate homomorphisms 

In this section we prove Theorems|2]and|3l bounding the extent to which a function from one finite 
group to another can act like a homomorphism. 

Proof of Theorem O Let a be an irreducible representation of H. We treat xp a = a o f as an ap- 
proximate representation of G of dimension d a . To bound ||E X ip(x) || , note that 

ft/vM = E p/d/My) = E(p/-")(i/) c7 (y) = l H l 0/ -")(»• 

yeH yeH 

where we used the fact that E y c(y) = 0. Then we have 

llf^Wllop < H?lM*)||? < \H\ 2 \\(fpu)(a)\\l < H || P/ - M |g < ^, 
where we used Plancherel's identity ((TO) in the third inequality. Thus 

llffrWlU^^- (22) 

and applying Corollary [T] completes the proof. □ 

Proof of Theorem [3j We let R = YL a &d rfu^u denote the regular representation of H. As above, for 
an irreducible representation a of H we define t/v = c o f. Observe that 

Pr[f(xy) =/(x)/(y)] = r^mR(f(xy)- 1 f(x)f(y))] = £ A- E [^(/(xy)" 1 /^) f(y))] 

II' aG fj I I 

= E ^E [tr^(xy)^(x)^(y))] 



< + E ^||E^(x)|| op ||E^(x)g+ E \ 1 



V „ ' 

(*) 
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the last inequality following from equation ((18)) . Focusing on the term (*) above, 

E 4||f ^(x)|| ||e m*)\\1 < E 4llf Mx)\\l 

||E,R(/(x))||"-l 



Hence 



|H| 

tr {E x E y R(f(x)-i)R(f(y))) 1 
|H| |ff | 

= Pr[f W =/( y )] * . 



pr^y) =/w/(y>] < w w = /(y)] + E iir min \\hr~- 1 ) 

<W(x)=/(y)]+R H . 

Finally, since 

PrLfW = /(y)] = E p/W 2 = IIp/II 2 = IN 2 + lb/ - "II 2 < w ' 

the statement of the theorem follows. □ 



3 Minors of representations and their polar decompositions 

In this section we prove Theorems H] and |5l 

Proof of Theorem® Let p : G — > U(V) be an irreducible representation of G of dimension d p and 
let n : V — > V be a projection operator of rank cL,. Treating the image of II as a subspace W, we 
consider the function xp : G — > End(W) given by 

^(x) = J-^-Up{x)U. 
y "if 

Then 

Eip(x) = w-^n (Ep(x)) n = o. 

Moreover, since p is irreducible, Schur's lemma gives 



rk n „ dy 



Ep( X yn P (x) =1 = ^1. (23) 

x Up Up 



Thus 

Eip(x) f xp(x) = ^EYIp(x) f Up{x)Yl = -fU (e p(x) f Tlp(x)) U = Yl. 

x dip x city V x J 
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Since IT is the identity on the subspace W, ip(x) is unitary in expectation. 
As in the proof of Theorem [TJ we then have 

E \\xp(xy) - ip(x) ip(y)\\l = 2d f (l - ±- Re E tr xp\xy) rp(x) ip(y) 

x,y y u^p x,y 

Since p is a genuine representation, p{xy) = p(x) p(y) and 

( d \ 3/2 

E tri/7 + (xy) \p(x) tp(y) = / E trUp\xy)n P (x)np(y)n 

x,y \ a tp / X 'V 

3/2 



= / E trUp(y) f p(x) f Up(x)np(y)n 

\Uf ) x,y 



^ \ 3/2 
di/> / 



Ep(y)np(y) + )E(p(x) + np(x))n 



(24) 



'^trn = d T/M / : 7 



Combining this with (|24|) completes the proof. 



□ 



Proof of Theorem\5\ The squared £2 distance between a matrix A and the unitary part of its polar 
decomposition, A = A(A + A)~ 1,/2 , is 



\A — A\ 



A — A{A i A) 



-1/2 



tr A + A - 2tr(A + A) 1/2 + d = £(A - l) 2 . 



Here A ranges over the singular values of A, i.e., the square roots of the eigenvalues of A f A. For 
any A > we have 

(A-l) 2 < (A-1) 2 (A + 1) 2 = (A 2 -l) 2 . (25) 
Thus the distance between A and A is at most the distance between A f A and the identity, 

2 



A-A|| 2 <£(A 2 -1) 2 = A' A-l 



(26) 



Let tp(x) = ^JdpJd^,Ilp{x)T\. Since TI is the identity on the subspace W, the rest of our proof 
consists of bounding 

E \\ip(x) — $(*)IIf < E t/;(x)>(x)-n 

n,.r n,x 



E 

n,x 

E 

ru 



xp(x) f ip(x) 



(27) 



where in the last line we used the fact, proved in Theorem HI that xp is unitary in expectation. We 
will then use the triangle inequality to bound 

EE ||^(xy)-^(x)^(y)|| 2 . 
11 x,y 
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We write 

2 



E 



xp( X yxp(x) 2 = \ tr E \llp i (x)Up{x)n P i (x)TIp(x)] . (28) 



We can view this trace as a contraction of two tensors. One is p <8> p + <S> p ® p + - Since we can take 
the expectation over all n by conjugating a particular n by a random unitary U G Urf p/ the other 
is the "twirl" of TI 04 , namely 

Y = E(!I f nU) 04 . 

Since Y commutes with the diagonal action of U(d p ), it is a member of the commutant, and hence 
an element of the group algebra C [S4] . Thus we can write 

Y = v(n) ■ n, 

neSi 

where we identify each n G S4 with its action on y® 4 . Moreover, since Y commutes with any 
7r G S4 the coefficients v(n) form a class function: v is constant on each conjugacy class and lies in 
the linear span of the characters of S4. 

We can compute the coefficients v(n) as follows. For any permutation a G S4, we have 

trT(7 = trn V = 4 (t7) / 

where c(a) is the number of cycles in a. For any A G S4, the inner product of the character Xa with 
the function d c ^ is 

[xx,d c{ - ) ) = T{Kd) l 



where T(A,d) denotes the number of semistandard tableaux of shape A and content in {1, . . . ,d}. 
The multiplicity of A in (C d ) 04 is also T(A, d), so taking traces gives 



v ( n ) = E d AAA(7T 



T(A, dp ) • 

A somewhat lengthy calculation gives the following coefficients for each of the five conjugacy 
classes in S4: 



" ((12)(34)) ^©) 2 ( 1 -^) 2+0 ^ 4 » 
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Note that v(n) scales as d p f ^ where t(n) = 4 — c(tz) is the transposition distance, i.e., the mini- 
mum number of transpositions whose product gives x. 

When 71 is the identity, one of the pairs of transpositions (12)(34), and four of the 3-cycles (123), 
we get p(x) p f (x) p(x) p + (x) = 1, contributing d p to the trace. Two of the six transpositions (12) 
contract with p (g) p f (g) p (g) p f to give p(x) p f (x) (g) p(x) p f (x) = 1 g) 1, which has trace d 2 p . These 
are the leading terms, and we get 



E tr 



Ylp f (x)Ylp(x)Up\x)Ylp(x 
= (u(l) + u((12)(34)) +4u((123))) d p + 2v((12)) d 

= dp (jf) ( 2 " + + ° tKer t6rmS/ 



0(d p 1 ) + other terms 



where here and in the sequel O(-) refers to the limit d p — > oo while d^l d p stays constant. 

To bound the other terms, let m denote the total multiplicity of irreducible representations 
appearing in the decomposition of p <g> p* . Then 



I 2 |2 

m and E \x P {x ) <tn. 



The first of these follows from Schur's lemma, since for any irrep T we have E xe c |/Ct(*)| 2 
The second follows from the Frobenius-Schur indicator, which for any irrep r is 



1. 



xeG 



+1 if t is real, 

if t is complex, 

— 1 if t is quaternionic. 



The other terms include contractions such as {pp A p) <8> p + = p <S> p + , (ft0 + ) <8) p ® |0 + = 1 
and so on. These terms scale as 

\i> U' ) 1 2 



(4 U ((12))+4u((1234))) E|^ p (x) 



o(^- 1 ; 



4y((123))d p E|^(x)|=0(d 



w((1234)) Elxpix)^ = 0{d p 3 m) 



u((1234)) E 1^(^)1* = 0(rf- 3 m 



2u((12)(34)) Ex p (x 2 )%(*) 2 = 0(d p 2 ) A /E |^(^ 2 )T E = 0(d p 2 m) . 



If G is quasirandom, with d^xm the dimension of its smallest nontrivial irrep, then 

1 



m < 1 



(29) 



since p (g> p* contains exactly one copy of the trivial irrep. Even if we content ourselves with the 
generous bound m < d 2 , the largest error term in (|29)) is 0(1). Thus 



E tr 



np + (x)n,o(x)np + (x)np(x) 



dA-r 
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Combining this with (|28|) gives 



xp(x) i xp(x) 



and so j27l ) gives 



E ||^(x)-^(x)||J<rf v 



+ 0(1) 



(30) 



(31) 
(32) 
(33) 



Finally, we return to our task of bounding 

E\\ip(xy)-ip(x)ip( y )\\l. 
For this purpose, we use the triangle inequality and write 

\\ip{xy) - i/S(x) #(y)|| p < \\${xy) - tp(xy)\\ v 

+ WW -$( x )Hy)h 

+ \\xp(x)xp(y)-^(x)xp(y)\\ F . 

In expectation, the squares of the terms appearing in d3Tb and d32l are precisely the topic of d30l 
and Theorem^ respectively. As for the quantity (|33]) , we may further expand it as 

\\ip(x) ip(y) - xp(x) ^(y)|| F < ||^(x) ^(y) - f (y)|| p + ||^(x) f (y) - ^(y)|| P 

= II (vW - #(*))?(y)|| F + WW (V(y) - <Ky)) ll F 
= II (VW - <K*))<Ky)|| F + llv(y) - £(y)ll F / 

where in the last line we used the unitarity of t/;(x). Squaring both sides and using the inequality 

(a + b + c + d) 2 < 4(a 2 + b 2 + c 2 + ci 2 ) gives 



ll<K*y) - <Ky)ll F < 4 (JI<K*y) - <K*y)ll F 
+ \\H*y) - H*) Hy)\\l 

+ \\{ip(x)-$(x))tp(y)\\l 

+ H(y)-ip(y)\\l) ■ 

When we take the expectation over y, the third term simplifies since i/>(y) is unitary in expectation: 

(xp(x) - t/5(x)) + (e rp(y) f rp(y)^j (f(x) - #(x)) 
(rp(x) - ip(x)) f (ip(x) - xp(x)) 



E||ty(x)-#(x))tf(y)|| F = tr 



tr 



= U(x)-xp(x) 



IF • 



Putting this together, taking expectations over IT, x, and y, using the fact that xy is uniformly 
random, and applying IpD) and Theorem H] gives 

E \\$(xy) - <£(*) {p(y)\\l < 4 (} E ||^(xy) - </;(x) <Ky)|| F + 3 E \\y{x) - t/S(x)|| F 



<2iJ4 1 



+ 6 1 



completing the proof. 



□ 
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There are a number of ways one might improve Theorem|5j The bound ([25]) , and therefore ((26|) , 
is off by a factor of (A + l) 2 4 when A is close to 1, i.e., when ip(x) is close to unitary. Us- 
ing the triangle inequality is also rather crude. With more thought one should be able to bound 
(l/^jEnM ||i/?(ri/) — ip{x) tp(y)\\ F with a smaller function of the ratio d^/dp, and thus achieve 
good approximate representations in lower dimensions. 

Acknowledgments We benefited from the lectures of Avi Wigderson and Ben Green at the Bel- 
lairs Research Institute of McGill University and from conversations with Jon Yard about twirled 
operators. This work was supported by the NSF under grants CCF-0829931, 0835735, and 0829917, 
and by the NSA under contract W911NF-04-R-0009. 

References 

[1] W. T. Gowers. Quasirandom groups. Combinatorics, Probability and Computing 17(3):363- 
387,2008. 

[2] Laszlo Babai and Katalin Friedl. Approximate Representation Theory of Finite Groups. Proc. 
32nd Symp. Foundations of Computer Science, 733-742, 1991. 

[3] Laszlo Babai, Katalin Friedl and Andras Lukacs. Near representations of finite groups, 
Manuscript, 2003. 

[4] Jean-Pierre Serre. Linear Representations of Finite Groups. Number 42 in Graduate Texts in 
Mathematics. Springer- Verlag, 1977. 

[5] Manuel Blum, Michael Luby, and Ronitt Rubinfeld. Self-testing/ correcting with applications 
to numerical problems. Journal of Computer and System Sciences, 47(3): 549-595, 1993. 

[6] Mihir Bellare, Don Coppersmith, Johan Hastad, Marcos A. Kiwi, and Madhu Sudan. Lin- 
earity testing in characteristic two. IEEE Transactions on Information Theory, 42(6):1781-1795, 
1996. 

[7] K. Fan and A. J. Hoffman, Some metric inequalities in the space of matrices. Proc. Amer. Math. 
Soc. 6 (1955) 111-116. 



15 



